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ABSTRACT  INTRODUCTION 


In  considering  the  problem  of  formation  control  in  the 
deployment  of  intelligent  munitions,  it  would  be 
highly  desirable,  both  from  a  mission  and  a  cost 
perspective,  to  limit  the  information  that  is 
transmitted  between  vehicles  in  formation.  However, 
the  lack  of  information  regarding  the  state  of  motion 
of  neighboring  vehicles  can  lead  to  degraded 
performance  and  even  instability.  This  paper  presents 
an  adaptive  output  feedback  approach  for  addressing 
this  problem.  We  design  adaptive  formation 
controllers  that  allow  each  vehicle  in  formation  to 
maintain  separation  and  relative  orientation  with 
respect  to  neighboring  vehicles,  while  avoiding 
obstacles.  The  method  works  by  enabling  each 
vehicle  in  the  formation  to  adaptively  correct  for  the 
effect  that  the  motions  of  neighboring  vehicles  have 
when  regulating  relative  variables  like  range  and  line 
of  sight.  It  is  assumed  that  estimates  of  these 
variables  can  be  derived  using  passive,  vision-based 
sensors.  The  need  for  explicit  communication  to 
maintain  formation  is  minimized  and  the  resulting 
controller  solution  is  decentralized.  We  implement  a 
reactive  obstacle  avoidance  controller  to  navigate  in 
an  environment  with  obstacles.  The  formation 
controller  and  obstacle  avoidance  controller  are 
outer-loop  controllers  whose  outputs  are  speed  and 
heading  commands.  These  commands  are  blended 
together  to  generate  composite  speed  and  heading 
commands  that  are  inputs  to  the  inner-loop  controller. 
The  weights  used  for  blending  the  commands  depend 
upon  the  priority  of  the  task  at  hand.  We  illustrate  the 
method  with  an  example  involving  a  team  of  three 
aircraft  keeping  formation  in  the  presence  of 
obstacles. 


As  demonstrated  in  recent  conflicts,  unmanned  aerial 
vehicles  (UAVs)  are  becoming  an  important 
component  of  our  military  force  structure.  UAVs, 
operating  in  close  proximity  to  enemy  forces,  provide 
real-time  information  difficult  to  obtain  from  other 
sources,  without  risk  to  human  pilots.  Technology 
demonstration  programs  such  as  UCAV  illustrate  the 
trend  toward  development  of  UAVs  that  will 
dominate  enemy  airspace  through  maintenance  of  a 
continuous  presence  over  the  battlefield.  Among  the 
weapons  employed  by  these  UAVs  will  be  flocks  of 
cooperative  miniature  or  micro  autonomous  vehicles 
(MAVs)  operating  in  close  proximity  to  terrain  or 
structures  that  will  gather  information  on  enemy 
movements  and,  under  human  supervision,  seek  out, 
identify,  and  attack  targets  of  opportunity.  Of 
course,  reduction  in  size  must  be  traded  against  a 
concomitant  reduction  in  loiter  time  due  to  reduced 
fuel  or  battery  capacity.  In  addition  to  advancements 
in  propulsion  technologies,  concepts  for  exploitation 
of  unsteady  aerodynamics  to  reduce  drag  are  being 
explored.  In  large  groups  of  MAVs  or  small  UAVs, 
even  small  percentage  reductions  in  drag  will  offer 
significant  increased  payoffs  in  the  ability  to  maintain 
persistent  coverage  of  a  large  area.  One  concept, 
well  known  to  bicyclists,  race  car  drivers,  and  pilots 
and  exploited  by  swimming  and  flying  animals,  is  the 
benefit  of  operating  in  the  wake  of  another  vehicle  (or 
organism).  Therefore  maintaining  a  formation  while 
at  the  same  time  executing  searches  in  a  congested 
environment  will  be  a  primary  requirement.  Stealth 
like  operations  will  also  be  important,  implying  the 
need  to  maintain  autonomy  and  to  minimize 
communication.  Maintaining  a  formation  is  also 
important  from  this  perspective  so  that  passive  (vision 
based)  sensing  can  be  used  to  ascertain  the  locations 


Graduate  Research  Assistant,  gte334x@prism.gatech.edu 
1  Professor.  Fellow  AIAA.  anthony.calise@ae.gatech.edu 
1  Chief,  Autonomous  Control  Team.  Senior  Member  AIAA.  evers@eglin.af.mil 


1 

American  Institute  of  Aeronautics  and  Astronautics 


Copyright  ©  2003  by  Georgia  Institute  of  Technology.  Published  by  the  American  Institute  of  Aeronautics  and  Astronautics,  Inc.,  with  permission. 


Report  Documentation  Page 


Form  Approved 
OMB  No.  0704-0188 


Public  reporting  burden  for  the  collection  of  information  is  estimated  to  average  1  hour  per  response,  including  the  time  for  reviewing  instructions,  searching  existing  data  sources,  gathering  and 
maintaining  the  data  needed,  and  completing  and  reviewing  the  collection  of  information.  Send  comments  regarding  this  burden  estimate  or  any  other  aspect  of  this  collection  of  information, 
including  suggestions  for  reducing  this  burden,  to  Washington  Headquarters  Services,  Directorate  for  Information  Operations  and  Reports,  1215  Jefferson  Davis  Highway,  Suite  1204,  Arlington 
VA  22202-4302.  Respondents  should  be  aware  that  notwithstanding  any  other  provision  of  law,  no  person  shall  be  subject  to  a  penalty  for  failing  to  comply  with  a  collection  of  information  if  it 
does  not  display  a  currently  valid  OMB  control  number. 


1.  REPORT  DATE 

AUG  2003 


2.  REPORT  TYPE 


3.  DATES  COVERED 

00-00-2003  to  00-00-2003 


4.  TITLE  AND  SUBTITLE 

An  Adaptive  Approach  to  Vision-Based  Formation  Control 


6.  AUTHOR(S) 


7.  PERFORMING  ORGANIZATION  NAME(S)  AND  ADDRESS(ES) 

Georgia  Institute  of  Technology, School  of  Aerospace 
Engineering, Atlanta, GA, 30332-0150 


5a.  CONTRACT  NUMBER 


5b.  GRANT  NUMBER 

5c.  PROGRAM  ELEMENT  NUMBER 

5d.  PROJECT  NUMBER 


5e.  TASK  NUMBER 


5f.  WORK  UNIT  NUMBER 

8.  PERFORMING  ORGANIZATION 
REPORT  NUMBER 


9.  SPONSORING/MONITORING  AGENCY  NAME(S)  AND  ADDRESS(ES) 


10.  SPONSOR/MONITOR'S  ACRONYM(S) 


11.  SPONSOR/MONITOR'S  REPORT 
NUMBER(S) 


12.  DISTRIBUTION/AVAILABILITY  STATEMENT 

Approved  for  public  release;  distribution  unlimited 

13.  SUPPLEMENTARY  NOTES 

AIAA  Guidance,  Navigation,  and  Control  Conference  and  Exhibit  11-14  Aug  2003,  Austin,  TX.  U.S. 
Government  or  Federal  Rights  License 


14.  ABSTRACT 

see  report 


15.  SUBJECT  TERMS 


16.  SECURITY  CLASSIFICATION  OF: 


a.  REPORT 

unclassified 


b.  ABSTRACT 

unclassified 


c.  THIS  PAGE 

unclassified 


17.  LIMITATION  OF 

18.  NUMBER 

ABSTRACT 

OF  PAGES 

Same  as 

13 

Report  (SAR) 

19a.  NAME  OF 
RESPONSIBLE  PERSON 


Standard  Form  298  (Rev.  8-98) 

Prescribed  by  ANSI  Std  Z39-18 


and  behaviors  of  cooperating  MAVs/UAVs.  Various 
approaches  to  close-coupled  formation  flight  control 
for  drag  reduction  include  PID  control,1  linear 
quadratic  regulator  (LQR)  based  decentralized 
control2  and  an  adaptive  approach1.  In  this  paper,  we 
do  not  focus  on  the  aerodynamic  interaction  effects  of 
formation  flight.  Instead  we  focus  on  the  coupling  of 
the  aircraft  kinematics  due  to  their  measurement  and 
control  strategies.  4  We  consider  formation  control 
within  the  context  of  coordinated  group  motion. 

Although  imperfectly  understood,  flocking  behavior 
of  birds,  schooling  behavior  of  fish,  and  even  studies 
of  swarming  insects  have  provided  inspiration  for 
concepts  of  coordinated  multi-vehicle  operation.  5 
Existing  works  on  coordinated  group  motion  include 
a  distributed  behavioral  approach  to  synthesizing  the 
flocking  motion  of  boids6  (bird  and  fish-like  objects). 
This  approach  assumes  a  flock  is  the  result  of  the 
interaction  between  the  behaviors  of  individual  boids 
(used  here  to  refer  to  individual  autonomous  agents 
operating  in  a  coordinated  manner).  It  was  similarly 
shown  in  Ref.  7  that  coordinated  multi-robot  motion 
could  be  constructed  by  using  a  small  basis  set  of 
behaviors.  A  control-theoretic  approach  to  formation 
control  is  given  in  Ref.  8.  The  control  laws  are 
derived  from  input-output  feedback  linearization 
theory.  9  The  control  laws  allow  each  follower  vehicle 
in  the  formation  to  regulate  range  and  relative 
orientation  with  respect  to  one  leader  vehicle,  or 
range  with  respect  to  two  leader  vehicles,  or  range 
with  respect  to  a  leader  vehicle  while  maintaining 
safe  distance  from  obstacles.  Switching  between  the 
control  laws  leads  to  changes  in  formation  shape. 
Related  work  on  formation  control  includes 
assignment  of  feasible  formations10  and  moving  into 
formation.  11 

Most  of  the  approaches  for  formation  control  assume 
that  a  leader  (neighbor)  vehicle’s  state  of  motion  is 
known  at  least  partially  to  the  follower  (other 
neighboring)  vehicles.  Ref.  3  assumes  the  leader 
vehicle’s  inertial  positions  are  available  to  the 
follower  vehicles.  Ref.  8  assumes  that  the  leader 
vehicle’s  velocities  and  the  relative  orientations  with 
respect  to  the  follower  vehicles  are  known  to  the 
follower  vehicles,  either  by  communication  or  by 
estimation.  Our  approach  treats  these  quantities  as 
modeling  uncertainties,  whose  effect  on  output 
regulation  is  to  be  canceled  by  the  output  of  an  online 
adaptive  neural  network  (NN).  As  a  result,  each 
vehicle  can  regulate  both  the  range  and  relative 
orientation  to  a  leader  and/or  neighboring  vehicle 
without  knowing  the  state  and  control  policy  of  that 


vehicle.  The  controller  solution  for  each  vehicle  is 
completely  decentralized.  It  is  assumed  that  each 
vehicle  can  measure  its  own  speed,  heading,  line-of- 
sight  (LOS)  range  and  angle  to  other  vehicles.  Vision- 
based  sensors  have  been  successfully  implemented  on 
ground  robots  for  autonomous  navigation.8  12  Recent 
work  has  also  been  done  on  implementing  vision- 
based  navigation  for  unmanned  aerial  vehicles.  13 

The  controller  architecture  in  our  approach  is  based 
on  dynamic  model  inversion.  Each  vehicle  in 
formation  makes  an  assumption  that  the  neighboring 
vehicles  are  stationary.  This  assumption  leads  to 
inversion  errors  that  are  approximated  online  by  a 
NN.  The  theory  is  based  on  an  error  observer 
approach  to  adaptive  output  feedback  control  of 
uncertain,  MIMO  systems.  14  This  approach  is 
adaptive  to  both  parametric  uncertainty  and 
unmodeled  dynamics.  The  need  for  this  approach 
stems  from  the  fact  that,  while  the  relative  degree  of 
the  regulated  LOS  variables  is  known,  the  degree  of 
the  plant  dynamics  is  unknown.  This  is  due  to  the  fact 
that  we  seek  a  decentralized  solution,  and  ignore  the 
fact  that  the  vehicle  dynamics  are  coupled  through  the 
regulated  LOS  variables.  It  is  also  due  to  the  fact  that 
we  are  ignoring  the  speed  and  heading  dynamics,  and 
employing  these  variables  as  control  variables. 

The  method  of  Pseudo-Control  Hedging  (PCH) 15,16  is 
used  to  protect  the  adaptive  process  from  actuator 
limits.  It  is  also  used  to  protect  the  adaptive  process 
during  periods  when  it  is  not  in  complete  control  of 
the  vehicle.  This  can  occur  when  priority  is 
temporarily  given  to  obstacle  avoidance,  and  when 
LOS  separations  from  more  than  one  aircraft  are  to  be 
simultaneously  controlled. 

There  are  numerous  approaches  to  static  obstacle 
avoidance.  A  popular  approach  is  the  Artificial 
Potential  Lield  Approach.  17  Other  approaches  include 
Motion  Planning18  and  ‘Steer  Towards  Silhouette 
Edge’.  19  In  this  paper  we  describe  and  implement  the 
last  approach,  because  this  approach  is  most  suitable 
for  application  with  vision-based  sensors. 

The  organization  of  the  paper  is  as  follows.  The  next 
section  summarizes  the  theory  for  the  error  observer 
approach  and  states  the  problem  formulation  for 
decentralized  formation  control.  Next,  we  develop  the 
individual  vehicle  model  and  the  relative  kinematics 
between  two  vehicles  in  a  formation.  The  next  section 
describes  the  inverting  control  design  for  formation 
control.  Lollowing  this,  we  describe  the  static 
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obstacle  avoidance  controller.  The  control  strategy 
obtained  by  combining  the  outputs  of  the  two 
controllers  through  a  blending  mechanism  enables  the 
vehicles  to  navigate  as  a  group  in  an  environment 
with  obstacles.  We  then  present  and  discuss 
simulation  results  that  illustrate  our  approach. 

ADAPTIVE  OUTPUT  FEEDBACK  APPROACH 

Consider  the  observable  nonlinear  system  described 
by 

X  =  fix,  u) 

n  (1) 

y  =  gw 

where  xe  c  9t"  are  the  states  of  the  system, 
u,ye  91"’  are  the  controls  and  regulated  output 
variables  respectively,  and  /(■,■),  g(-)  are  uncertain 
functions.  Moreover  n  need  not  be  known. 

Assumption  1  The  system  in  (1)  satisfies  the 
condition  for  output  feedback  linearizability  with 

vector  relative  degree  [r, ,  r2 , . . . ,  rm  \r , 

,  20 

r  =  r1+r2+...rm  <n. 

Then  there  exists  a  mapping  that  transforms  the 
system  into  the  so-called  normal  form: 

Z  =  f'{H,z) 

■  (2) 

Z?  =hM,X,u) 

i  =  1.2,  •  •  • ,  m, 

where  ht  (g,  X,  u )  =  ht  (x,  u ) ,  where  |  . . .  %Tm  f , 

and  X  are  the  states  associated  with 
the  internal  dynamics.  Note  that  <f/+1  is  simply  the 
j‘h  time  derivative  of  y. . 

Assumption  2  The  zero  dynamics  are  asymptotically 
stable. 

The  objective  is  to  design  an  output  feedback  control 
law  that  causes  yl  (t)  to  track  a  smooth  bounded 
reference  trajectory  yl:l  (l )  with  bounded  tracking 
error. 


Controller  Design  and  Tracking  Error  Dynamics 

Feedback  linearization  is  achieved  by  introducing  the 
following  inverse 

u  =  h~l{y,v)  (3) 

where 

V  =  h(y,u)  (4) 

is  the  pseudo-control  signal.  The  pseudo-control 

signal  h{y,u)=  \\  (y,u),...,  hm  ( y,u  >r  represents  an 
invertible  approximation  to 

h(x,  u)=  [/z1  (x,  u ), . . . ,  hm  (x,  u  )]7  in  (2),  which  is 
limited  to  using  only  the  available  measurements  and 
control  signal.  If  outputs  other  than  the  regulated 
output  are  available  for  feedback,  they  may  also  be 
used  in  (3)  to  form  the  approximate  inverse. 

Thus  the  system  dynamics,  as  far  as  the  regulated 
output  variable  is  concerned,  is  given  by, 

yr  =  v  +  A  (5) 

where 

A(£  XX)  =  h[%,  x,  h  {y ,  v))-  h[y,  ft  ~l  (y ,  v))  (6) 

is  the  inversion  error  that  results  from  the  use  of  (3)  in 
place  of  an  exact  state  feedback  inverse.  The  pseudo¬ 
control  is  chosen  to  have  the  form 

v  =  yrc+vic-vad  (7) 

where  y’c  are  generated  by  stable  reference  models 
that  define  the  desired  closed-loop  behavior,  Vdc  is 
the  output  of  a  dynamic  compensator  designed  to 
stabilize  the  linearized  error  dynamics,  and  vad  is  the 
adaptive  component. 

From  (5)  and  (7),  the  error  dynamics  are  given  as, 

yr  =  yrc~yr  =-ydc+  K,  - v')  (8) 

From  (6)  and  (7)  it  is  seen  that  A  depends  on  Vad 
through  v ,  and  ( 8)  shows  that  vad  has  to  be  designed 
to  cancel  A .  Therefore  the  following  assumption  is 
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introduced  to  guarantee  existence  and  uniqueness  of  a  with  nl  >  n.  Since  n  is  unknown,  a  sufficient 

solution  for  vad  .  number  of  delayed  signals  are  required. 


Assumption  3  The  map  V all  h»A  is  a  contraction 
over  the  entire  input  domain  of  interest.  It  can  be 
shown  that  this  assumption  leads  to  the  following 
conditions21 


ii) 


i)  sgn 
dh. 


du. 


dhi 

dll , 


1 

>  — 
2 


'  dh, ' 

=  sgn 

/ 

du,- 

v  '  y 

dll- 


dll: 


,  i  =  1,2,  ...,m  . 


>  0 ,  i  =  1,2, . . . ,  m  . 


The  input-output  map  of  a  SHL  NN  is  given  by 

vad=WTa{vTx)  (11) 

where  a  is  the  so-called  squashing  function.  The  NN 
is  trained  online  with  the  adaptive  law 

W  =  -rw  [2 (cr(y  r3f)-  a  {vTx)/ tx)etPB  +  kw\ 
y  =  -rv  [2  xetpbwtcj{vtx)+  kv] 


The  first  condition  requires  that  the  sign  of  the 
control  effectiveness  is  modeled  correctly  and  the 
second  places  a  lower  bound  on  the  estimate  of  the 
control  effectiveness. 

Error  Observer 


It  can  be  shown  that  the  error  dynamics  in  (8)  can  be 
written  as 


/ 

where  <7  (z)  =  diag 

V 


dcy 

dz, 


\ 

/ 


,  P  is  the  positive  definite 


solution  to  the  Lyapunov  equation 
A1  P  +  PA  +(2  =  0,  with  (2>0,  and  rw  and  rv.  are 
the  adaptation  gains.  It  has  been  shown  that  the 
adaptive  law  in  eq.  (12)  guarantees  (subject  to  upper 
and  lower  bounds  on  the  adaptation  gains)  that  all 
error  signals  and  the  NN  weights  are  uniformly 
ultimately  bounded.  14 


E  =  AE  +  B[vad-  A] 


(9) 


Pseudo-Control  Hedging  (PCH) 


where  the  elements  of  E  are  made  up  of  yj  and  its 
derivatives  up  to  order  (r,  - 1)  and  the  dynamic 
compensator  states.  An  error  observer  is  designed 
based  on  this  equation14,  which  results  in  error 
estimates  E  that  are  used  in  the  adaptive  update  law 
given  below. 

Approximation  of  the  Inversion  Error 

The  inversion  error  A  can  be  approximated  to  any 
desired  degree  of  accuracy  by  using  a  Single  Hidden 
Layer  Neural  Network  (SHL  NN)  with  sufficient 
number  of  hidden  layer  neurons,  and  having  the 
following  input  vector,  22  23 

x{t)  =  [1  vj (0  yTd  (f)f  (10) 

where 

v]  (0  =  M4  v(t-d). . v{t - (n,  -  ])d)]' , 

yl (0  =  [>'(0>  y{t-d),...,y(t- (n]  - 1  )d )f 


PCH  is  introduced  to  protect  the  adaptive  law  from 
effects  due  to  actuator  limits  (such  as  rate  and 
position  limits),  time  delays  and  unmodeled  actuator 
dynamics.  1516  The  main  idea  behind  PCH 
methodology  is  to  modify  the  reference  command, 
yc ,  in  order  to  prevent  the  adaptive  element  from 
adapting  to  these  actuator  characteristics.  This  is 
commonly  done  by  generating  the  command  using  a 
reference  model  for  the  desired  response.  The 
reference  model  is  ‘hedged’  by  an  amount  equal  to 
the  difference  between  the  commanded  and  an 
estimate  for  the  achieved  pseudo-control.  To  compute 
this  difference,  a  measurement  or  estimate  of  the 
actuator  position  u  is  required.  The  pseudo-control 
hedge  signal  is  given  by, 

v,H  =  kyy^cmdi  rkiy^il 1  =  U  >•••>'»  ■  (13) 

The  PCH  signal  is  then  introduced  as  an  additional 
input  to  the  reference  model.  If  the  reference  model 
update  without  PCH  is  of  the  form, 

yrci  =  hnn,  (^C,  .  yci  .  •••>  y'C'Vcmdi  ). 
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(18) 


where  ycmd.  is  the  external  command  signal,  then  the 
reference  model  update  with  PCH  is  set  to 

=  hnn ,  [yCi .  yCi  >  •  •  •  >  yV  >  •  (14) 

The  instantaneous  output  of  the  reference  model  used 
to  construct  the  pseudo-control  signal  remains 
unchanged  and  is  given  by 


Z{r)  =  gr[xi,XjUj,Uj) 

To  arrive  at  a  decentralized  control  solution,  the 
following  approximation  is  employed  by  the  i'h 
vehicle 


(19) 


Vn,n  =  hrm  (^c-  .  yci  >•••>  yV’  )  (15) 

The  block  diagram  of  the  MRAC  controller 
architecture  with  PCH  and  error  observer  is  given 
below. 


Equation  (19)  forms  the  basis  for  an  inverting  control 
design  in  which  the  inversion  error  is 


A,  =  Sr  (*;  >  *  j,Ui  ’  « j)~  8  n  U  >  Ui  ) 


(20) 


Figure  1 .  MRAC  architecture  with  PCH 
Formation  Control  Formulation 


Consider  a  group  of  N  vehicles  whose  individual 
dynamics  are  given  by, 

*(=//(*;>«()>  1  =  1,2,...,  N  (16) 

where  xt  represents  the  states  and  ut  the  control 

vector  of  the  i'h  vehicle.  Assume  that  vehicles  i  and 
j  cooperate  by  regulating  a  joint  variable  (e.g.,  FOS 
range) 

z  =  g{xi,xj)  (17) 


Vehicle  i  s  inverting  solution  is  augmented  with  a 
NN  that  estimates  and  approximately  cancels  A, .  The 

input  vector  to  the  NN  for  the  i'h  vehicle  is  given  by 
Mi  =[xi,ui,zd(t)]T ,  where  zd  (?)  is  a  vector  of 
sufficiently  large  number  of  delayed  values  of 
z\t )  '  .  So,  the  decentralized  control  solution  of  all 

cooperating  aircraft  is  given  by  m(  =  g“1(vi ,  z,  x, ), 
where  v,  is  constructed  as  in  equation  (7). 

APPLIC  ATION  TO  FORMATION  CONTROL 

We  apply  the  approach  described  in  the  previous 
section  to  construct  an  adaptive  formation  controller. 
But  first  we  discuss  some  modeling  issues.  The 
formation  of  vehicles  is  constrained  to  lie  in  a  two- 
dimensional  plane.  The  vehicles  are  considered  to  be 
point-mass  objects  that  can  accelerate  both  along  and 
perpendicular  to  the  direction  of  motion.  In  the 
following  section,  the  equations  of  motion  for  a  single 
vehicle  are  developed. 

Vehicle  Dynamics 

Consider  the  equations  of  motion  of  an  aircraft  in  the 
horizontal  plane. 


whose  relative  degree  (r)  is  known,  so  that. 
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L 


where  a,  =  Jn2  - 1  and  a ,  =  f  —  ) 

[wj 


represent  the 


non-dimensionalized  controls.  The  drag  D  is  given  by 
D 
by 


1  , 

D  =  —  pSV  CD  .  The  drag  coefficient  CD  is  given 


Figure  2.  Banked  Horizontal  Turn 


r  =  C 

D  ^  Dn 


+  kC f 


(28) 


With  reference  to  figure  2,  the  equations  of  motion 
are  given  by, 

mV ij/  =  Lsin  <j>  (21) 

mV  =  T—D  (22) 

mg  =  L  cos  </>  (23) 


where  CDn  is  the  profile  drag  coefficient,  assumed  to 

be  constant,  CL  is  the  lift  coefficient  and  kC2l  = 
induced  drag. 

Equation  (28)  can  be  modified  to  replace  the  lift 
coefficient  with  the  load  factor  as  follows: 


where  if/,V  represent  the  heading  and  speed  of  the 
aircraft  with  respect  to  an  inertial  frame  that  will  be 
specified  later,  m,  <j>  represent  the  mass  and  bank 
angle,  L,  T,  D  represent  the  lift,  thrust  and  drag 
forces  on  the  aircraft  and  g  is  the  acceleration  due  to 
gravity. 


L  pSV-C,  „  2  nW 

n  =  —  = -  =>  CL  = - - . 

W  2 W  pSV 2 


( pscDv2) 

,2 

V  + 

'  2  kW  2 

(  2  'N 

n 

2  W 

V  / 

l psv. o) 

V'2 
\v  y 

Eliminating  <f>  fromeqs.  (21)  and  (23), 


v= 

Un2- 1 

V 

(24) 

(  T-D\ 

V  = 

TI,  L? 

(25) 

1  w  J 

where  n  =  —  is  the  load-factor  of  the  aircraft. 

[w] 

Equations  (24)  and  (25)  can  be  non-dimensionalized 


by  letting  t  =  t 


KRo; 


and  V  =  V  (Vn )  represent  non- 


dimensional  time  and  speed,  where  Va  and  Ra  are 
constant  quantities  with  units  of  speed  and  distance 
respectively.  A  way  to  choose  V o  and  Ro  would  be  to 

set  V2  =  gR0  .  Furthermore,  we  set  Vo  =  Ro . 

=>  Va  =  R0  =  g  .  This  implies, 


dip  _  at 
dt  V 

dV'  (  D' 1 

dt'  ~Ul  [w  J 


(26) 

(27) 


—  =  k.V1  +  k2 
W 


f  2  , 

a,  +1 


V 


(29) 


where  a  1 


■\l n  2  ~  \  ,  k12  are  non-dimensional 


constants  and  V  is  the  non-dimensionalized  velocity. 
So  the  non-dimensional  velocity  equation  is  given  by 
substituting  for  (29)  in  (27): 


dV  ,  ,2 

— -  =  a2~klV  —  kr, 
dt 


f  2  ,  ,\ 

a,  +1 


V 


(30) 


Limits  on  Controls 


Since  the  controls  used  in  our  formulation  are 
physical  quantities,  we  need  to  impose  realistic 
bounds  on  their  values.  From  (21),  (23)  and  (24),  we 
can  write 

ip  =  —  tan  (!)  =  — -\/n2  -1 
V  V 

=>  tan  (j)  =  sin2  - 1  =  a, . 

If  we  set  the  maximum  bank  angle  =  60" ,  we 
have  nmax  =  2  .  At  low  speeds,  the  maximum  load 
factor  is  limited  by  the  stall-limit. 24  At  low-speeds. 


6 

American  Institute  of  Aeronautics  and  Astronautics 


iw  fV2sc. 


Anax 


w 


2  W 


(31) 


Vj  sin(^  -  A- )-Vi  sin(yr  -  A- ) 
A;  = - - - 


(39) 


The  minimum  value  of  n  max  is  set  to  1  and 
corresponds  to  straight,  level  flight  when  the  lift  force 
equals  the  weight.  The  limits  for  control  a,  are  given 
below. 


<*im  =min(tan(Z>max,^H^-l 


Anlin  =  max(^-  tan  ,-^x  - 1 J 


(32) 


The  minimum  value  of  «max  is  associated  with  a 
minimum  speed  given  by 


(33) 


At  this  speed,  no  turning  is  possible.  A  similar 
method  can  be  used  to  determine  the  bounds  on  a , .  It 
is  assumed  that  the  maximum  and  minimum  values  of 
a 2  do  not  vary  with  speed  and  are  equal  to  ±0.2  . 
Note  that  drag  equals  the  maximum  value  of  a2  at 
two  speeds.  The  lower  of  these  speeds  is  set  to  V'min  2  . 
A  dynamic  lower  limit  is  imposed  on  a2  to  prevent 
the  speed  from  going  below  a  minimum  value  given 

by 

L  =max(yirinl,yirin2)  (34) 


The  dynamic  lower  limit  equals  the  drag  force  at 
speeds  marginally  greater  than  Vmln  and  switches  to 
equal  -0.2  at  higher  speeds. 

We  model  the  actuator  system  as  a  saturation  element 
with  limits  described  above 

M=sat  (ucmd)  (35) 


Relative  Kinematics 


where  (x;,y;)  represent  the  Cartesian  position 
coordinates  of  each  aircraft  with  respect  to  the  inertial 
frame  as  shown  in  figure  (3).  Equations  (38)  and  (39) 
constitute  the  relevant  kinematical  equations  for 
formation  flight. 


The  information  available  to  aircraft  i  include:  Vt . 
(by  use  of  an  inertial  measuring  unit  IMU),  Rt] ,  A:/ 
(through  vision-based  sensors)  and  the  control  signals 

an , aa ■ 

Inverting  Formation  Control  Design 

We  design  an  inverting  controller  augmented  with  a 
NN  for  aircraft  i  for  regulating  the  LOS  range  RtJ 
with  respect  to  aircraft  j  .  The  controller  architecture 
is  as  shown  in  figure  1.  The  relative  degree  of  Rt] 
with  respect  to  the  speed  and  heading  of  aircraft  i  is  1 . 
Hence  the  range  command  Rcom  is  filtered  through  a 
first  order  reference  model.  Figure  4  shows  the 
hedged  reference  model.  A  rate  limit  is  introduced  so 
that  the  reference  model  does  not  command  large 
range  rates  when  the  range  error  is  large.  The 
parameter  p  is  the  time  constant  and  is  a  design 
parameter. 


The  kinematics  of  the  ith  aircraft  in  non-dimensional 
form  are  given  by 

x,  =  C,  cos  y/t  (36) 

Ji  —  Vj  sin  Yt  (37) 

R,i  =  Vj  cos {y/j  -  A- )-  Vi  cosfo  -  A- )  (38) 


Figure  4.  Hedged  Reference  Model 
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(41) 


The  dynamic  compensator  portion  of  the  pseudo¬ 
control  is  a  proportional  error  controller, 
v  dc  =  k  p  [Rc  —Rjj).  The  pseudo-control  signal  is 

Rij=Rc+kp{Rc-R^-vad. 

Equation  (38)  shows  that  the  LOS  range  dynamics 
depends  upon  the  speed  and  heading  of  both  the 
vehicles.  We  assume  that  aircraft  i  does  not  know  or 
have  access  to  the  state  and  control  policy  of  aircraft 
j.  So,  the  approximate  model  used  by  aircraft  i  for 
dynamic  inversion  is  given  by 

Ry  =  —VFCi  cos (i/fFCi  —  Ay )  (40) 

Equation  (40)  says  that  aircraft  i  assumes  aircraft  j 
is  stationary.  An  alternate  way  of  assuming  an 
approximate  model  for  aircraft  i  would  be  to  assume 
that  aircraft  j  has  the  same  component  of  speed 
along  and  perpendicular  to  the  LOS  as  aircraft  i . 

Equation  (40)  is  to  be  inverted  (solved)  for  the 
appropriate  speed  and  heading  command  signals 
{VFCi ,  Yfo  )  ■  This  is  done  as  follows.  We  assume  that 
each  vehicle  has  a  body-fixed  frame  attached  to  its 
center  of  mass.  The  x-axis  of  this  body  frame  is 
always  oriented  along  the  direction  of  the  velocity 

vector.  Note  that  -  R;J  represents  the  commanded 

velocity  vector  for  aircraft  i  oriented  along  the  LOS 
from  aircraft  i  to  aircraft  j  .  Figure  5  gives  a 
visualization  of  the  key  variables  from  the  perspective 
of  aircraft  i. 


Figure  5.  Pseudo-control  Vector  in  Body-Fixed 
Frame 

We  write  the  pseudo-control  vector  as  the  sum  of  its 
current  velocity  vector  and  a  perturbation  velocity 
vector.  We  represent  vectors  with  boldfaced  symbols. 


-/?,  =  Vl+AVcmdlj=V: 

Here,  V FCi  represents  the  velocity  vector  command 
for  formation  control.  In  case,  aircraft  i  is  regulating 
LOS  range  with  respect  to  multiple  ( N  >  l)  aircraft, 
equation  (41)  is  modified  as  follows 

N 

*  I>„  R„  =  Vrc,  (42) 


Equation  (42)  shows  that  the  commanded  velocity 
vector  is  a  weighted  average  of  the  pseudo-control 
vectors  associated  with  multiple  neighboring  aircraft. 
The  weights  wit  are  chosen  such  that  0  <  vv,(  <  1, 


N 


Y,  Wy  =  1 .  A  simple  case  results  when  wrl 


1 

N  ' 


=>y«*  =  "7  Z4  (43) 

j=Uj*i 

Equation  (43)  implies  that  aircraft  i  tries  to  meet  the 
LOS  range  requirement  equally  with  respect  to  all  the 
neighboring  aircraft.  Speed  and  heading  commands 
are  generated  from  equation  (43)  as  follows: 

VFa  cos 0FCi  =-—  Y,  R,j  cos 6E 
^  i= hi*i 

1  w  ■ 

VFCi  sin  dFCl  =-—  sin0ij 

N  j=Uj*i 

where  0,.  =  —  yi .  The  above  equations  are  solved 

for  (v Fa ,  y/ Fa ) , 


v  = _ 

FQ  n 


Y  ( 


Yk‘Jcosd-j  1  X R' sin  (Ji 


(44) 


and  iff FCi  —  y/ j  +0fa 

(  n 


(45) 


where  6 Fa  =  tan  1 


Yk  suU) 


MJ*i 


N 


-  Yk‘Jcos0v 
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Hedge  Signals 


For  the  case  with  adaptation,  the  reference  model  for 
the  formation  controller  in  each  vehicle  is  hedged 
whenever  the  commanded  pseudo-control  is  not  equal 
to  the  achieved  pseudo-control.  The  hedge  signals  for 
aircraft  i  are  given  by 

vi„j  =R,j  ~v,  cos(^,-'0  (46> 

Note  that  Vhlj  =£  0  in  three  cases: 

i)  the  actuators  of  aircraft  i  are  saturated, 

ii)  aircraft  i  is  controlling  separation  simultaneously 
from  multiple  (N  >  l)  neighboring  aircraft  and  the 
commanded  velocity  vector  is  given  by  equation  (43). 

iii)  Priority  is  temporarily  given  to  obstacle 
avoidance.  In  this  case,  c,  =£0  (see  the  next  section) 
and  the  commanded  velocity  vector  is  given  by 
equation  (47). 

Static  Obstacle  Avoidance  Controller 

The  controller  design  strategy  for  static  obstacle 
avoidance  is  based  on  a  reactive  ‘steer  towards 
silhouette  edge’  approach.  19  The  idea  is  to  project  the 
shape  of  nearby  obstacles  onto  the  local,  body-fixed 
frame  of  the  vehicle.  If  this  projected  shape,  adjusted 
(enlarged)  to  allow  for  the  size  of  the  vehicle  and  the 
required  ‘clearance’  distance,  surrounds  the  origin  of 
the  vehicle’s  body-fixed  frame,  then  some  portion  of 
the  obstacle  is  dead  ahead  (see  figure  6).  The  vehicle 
must  steer  away  to  avoid  a  collision,  and  the  most 
efficient  direction  to  turn  is  toward  that  portion  of  the 
projected  shape  that  is  closest  to  the  origin. 

To  illustrate  the  concept,  it  is  assumed  that  the 
obstacles  are  contained  within  bounding  spheres 
(circles  in  2  dimensions),  and  that  the  centers 
(Xo,Yo)  and  radii  (r)  of  the  obstacles  are  known. 

The  goal  of  this  strategy  is  to  keep  an  imaginary  line 
La  of  length  Da  ,  originating  at  the  vehicle’s  current 
position  and  extending  in  the  direction  of  the  velocity 
vector,  from  intersecting  with  any  obstacle  boundary.6 
The  length  of  this  line  is  typically  based  upon  the 
vehicle’s  speed  and  maneuverability.  An  obstacle 
further  away  than  this  length  Do  is  not  an  immediate 
threat.  The  obstacle  avoidance  behavior  considers 
each  obstacle  in  turn  and  determines  if  they  intersect 
with  La  .  This  intersection  calculation  is  done  by 
checking  if  (see  figure  6) 


i)  sgn(y,)*sgn(;y2) 

ii)  =  0  or  y2  =  0  . 


Figure  6.  Obstacle  Avoidance 

If  either  of  the  above  two  conditions  is  true,  then 
there  happens  to  be  a  line-obstacle  intersection.  The 
obstacle  which  intersects  La  nearest  the  aircraft  is 
selected  as  the  “most  threatening”  and  corrective 
steering  action  is  undertaken  to  avoid  this  obstacle.  If 
no  obstacle  collision  is  imminent,  no  steering  action 
is  taken. 

Note  that  jj  and  y2  are  the  extremities  of  the 
projected  edge  of  the  obstacle  in  the  vehicle’s  body- 
frame  and  are  given  by  yl2  =Yo  ±  r  ,  where 

Yo  =  ~{Xo  -  X,.  )sin  y/l  +  (l Yo  -  yt  )cos  yrt , 

is  the  y-coordinate  of  the  center  of  the  obstacle  as 

seen  in  aircraft  i  ’s  body-frame. 

Corrective  steering  action  to  avoid  an  obstacle 
involves  a  speed  and  heading  change  command.  The 
heading  change  command  A  \j/OA  is  towards  the 
closest  projected  edge  of  the  obstacle  as  shown  in 
figure  6.  The  speed  change  command  is  given  by 
k 

AVoa  = - — ,  where 

d  obs 

dobs  =  yj{X0-Xi)2  +(T0-y,-)2  ~r  is  the  distance  of 
the  aircraft  from  the  obstacle  and  kv  >  0  is  a 
constant.  The  speed  command  for  obstacle  avoidance 
is  given  by  V0A  =  V  +  AVoa  . 

Composite  Speed  and  Heading  Commands 

Maintaining  a  desired  formation  in  the  presence  of 
obstacles  may  not  be  feasible.  However,  it  is  not 
desirable  that  each  vehicle  abandon  the  group  by 
taking  corrective  action  upon  detecting  an  imminent 
collision.  By  blending  the  commands  coming  from 
the  obstacle  avoidance  controller  and  the  formation 
controller,  a  compromise  between  two  possibly 
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conflicting  commands  can  be  achieved.  The  weights 
used  for  blending  must  be  chosen  carefully  in  order  to 
guarantee  safe  and  coordinated  group  motion. 

The  composite  velocity  vector  command  is  given  by 

Vem*  =  clV0AI+{l-c1)VFa  (47) 

which  in  turn  implies  the  following  speed  and 
heading  commands 


We  consider  a  team  of  3  aircraft  flying  in  formation 
in  a  2  dimensional  environment  with  obstacles. 
Aircraft  1  is  the  team  leader.  It  sets  the  trajectory  for 
the  formation  by  commanding  a  sequence  of  heading 
changes  at  specified  time  intervals  while  commanding 
constant  speed.  In  addition,  it  also  cooperates  with 
the  follower  aircraft  2  and  3  in  regulating  desired 
LOS  separation  from  them.  The  velocity  vector 
command  for  the  leader  aircraft  is  given  by. 


Vcmdi  ~  yj^XcrmU  +  ^1 


Vcm*  =  V,  +  tan’ 


2 

Ycmdi 

Vy„ 


V  ^ Xcmdi  J 


(48) 


where 

V Xcmdi  =  (C1  VoAi  COS  ^  V OAi  +  (l  —  Cl  )^FCi  COS  &FCi  ) 
V Ycmdi  =  (CiKm;  S'n  ^  VoAi  +  (l  —  C1  )^FCi  S^n  @ FCi  ) 


The  weight  c,  is  chosen  to  reflect  the  fact  that 
obstacle  avoidance  has  higher  priority  than  the  need 
for  formation  keeping.  4 

If  dobs  <  3  Da 


/ 

(a  t\\\ 

d0bs 

V 

{  D'o  )) 

else  (50) 

cl  =  0 

end 


Da  is  a  scaling  factor.  Note  that  0  <  c,  <  1 . 
Inner-Loop  Controller 

The  inner-loop  controller  generates  actuator 
commands  to  achieve  the  speed  and  heading 
commands. 

ancmd  =  Kv  {Vcmdi  -  Vf )+  knV,2  +  + 1) 

*  i  W 

«,1  cmd=Kw{^cmdi-V,)Vl 

Note  that  the  actuators  are  subjected  to  a  limit  as 
defined  eq.  (35),  and  the  discussion  that  precedes  it. 


SIMULATIONS 


Yc,nd\  ~  ClV()A\  +  C2  Cl)YLi+{l  C2(l  c  FCl 

where  VLl  refers  to  the  to  the  leader  portion  of  the 
total  aircraft  1  command.  We  set  c2  =  0.7  .  Each 
follower  aircraft  regulates  LOS  separation  from  the 
leader  and  from  each  other.  The  LOS  separations  are 
the  jointly  regulated  variables  since  the  aircraft 
mutually  regulate  them  to  specified  values.  In 
addition,  the  3  aircraft  are  also  commanded  to  avoid 
obstacles  during  their  motion.  We  present  results  for 
cases  with  adaptation  (NN  on)  and  without  adaptation 
(NN  off).  Hedging  is  on  (H  on)  only  for  NN  on. 

Figure  7  shows  the  trajectory  plot  of  the  maneuver 
with  NN  off.  The  trajectory  is  marked  every  2.5 
seconds.  Aircraft  1  starts  at  (0,0),  aircraft  2  at  (5,  -8) 
and  aircraft  3  at  (-5,  -8).  The  filled  circles  represent 
the  obstacles.  Figure  8  shows  the  trajectory  plot  of  the 
maneuver  with  NN  on.  Note  that  with  NN  on,  aircraft 
3  does  not  go  around  an  obstacle  in  the  first  few 
seconds  of  the  maneuver.  This  illustrates  that 
adaptation  helps  the  aircraft  to  make  a  better  choice 
between  formation  keeping  and  obstacle  avoidance. 
Also,  note  the  size  of  the  box  that  the  aircraft  fly  in 
figure  8  as  compared  to  figure  7.  It  is  clear  that 
cooperative  formation  flight  is  enhanced  with  NN  on. 

In  figures  9-12  we  show  results  pertaining  to  aircraft 
2.  The  results  for  the  other  aircraft  are  similar.  Figure 
9  shows  the  reference  signal  tracking  with  both  NN 
off  and  NN  on.  Note  that  there  is  significant  hedging 
of  the  reference  command  with  NN  on,  and  that  the 
tracking  is  significantly  improved  in  comparison  to 
the  NN  off  result. 

Figure  10  shows  the  control  histories  for  aircraft  2. 
The  controls  are  frequently  saturated  during  the 
maneuver.  Figure  1 1  shows  the  time  histories  of 
weight  c,  for  aircraft  2.  The  plot  shows  that  in  the 
first  15  seconds,  when  the  aircraft  is  near  an  obstacle, 
it  spends  much  less  time  avoiding  obstacles  with  NN 
on.  Figure  12  compares  the  inversion  error  with  the 
NN  output  of  aircraft  2. 
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Figure  7.  Trajectory  (NN  off) 


Figure  10.  Control  Histories  (Aircraft  2) 


Figure  8.  Trajectory  (NN  on) 


Figure  11.  c,  History  (Aircraft  2) 


Figure  9.  Reference  Signal  Tracking  (Aircraft  2) 


Figure  12.  Inversion  Error  Approximation  (Aircraft 
2) 
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CONCLUSIONS 


We  have  formulated  a  decentralized  adaptive 
guidance  strategy  that  enables  safe  and  coordinated 
motion  of  a  group  of  unmanned  vehicles  in  an 
environment  with  known  obstacles.  The  strategy 
blends  the  outputs  of  multiple  adaptive  controllers 
with  a  reactive  obstacle  avoidance  controller.  There 
are  two  key  aspects  essential  to  the  adaptation 
process.  First,  an  output  feedback  formulation  is 
employed  that  is  robust  to  unmodeled  dynamics. 
Second,  pseudo-control  hedging  is  needed  to  account 
for  actuator  saturation,  obstacle  avoidance  and  the 
fact  that  separations  from  multiple  aircraft  cannot  be 
simultaneously  controlled.  The  adaptive  controller  in 
each  vehicle  utilizes  local  information  and  LOS 
measurements  obtained  with  vision-based  sensors. 
This  enables  each  vehicle  to  correct  for  the  unknown 
relative  motion  of  neighboring  vehicles.  The 
decentralized  nature  of  the  controller  permits  scaling 
to  any  number  of  vehicles  in  a  formation. 
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